Proceedings of the International Congress of Mathematicians 
Hyderabad, India, 2010 



Order and disorder in energy minimization 

Henry Cohn 



Abstract. How can we understand the origins of highly symmetrical objects? One way 
is to characterize them as the solutions of natural optimization problems from discrete 
geometry or physics. In this paper, we explore how to prove that exceptional objects, 
such as regular polytopes or the Eg root system, are optimal solutions to packing and 
potential energy minimization problems. 

Mathematics Subject Classification (2000). Primary 05B40, 52C17; Secondary 
11H31. 

Keywords. Symmetry, potential energy minimization, sphere packing, _E8, Leech lattice, 
regular polytopes, universal optimality. 



1. Introduction 

1.1. Genetics of the regular figures. Symmetry is all around us, both 
in the physical world and in mathematics. Of course, only a few of the many 
possible symmetries are ever actually realized, but we see more of them than 
we seemingly have any right to expect: symmetry is by its very nature delicate, 
and easily disturbed by perturbations. It is no great surprise to see carefully 
designed, symmetrical artifacts, but it is remarkable that nature can ever produce 
similar effects robustly, for example in snowflakes. Any occurrence of symmetry 
not deliberately imposed demands an explanation. 

Laszlo Fejes Toth proposed to seek the origins of symmetry in optimization 
problems. He referred to the genetics of the regular figures^ in which "regular 
arrangements are generated from unarranged, chaotic sets by the ordering effect 
of an economy principle, in the widest sense of the word" |28| . It is not enough 
simply to classify the possible symmetries; we must go further and identify the 
circumstances in which they arise naturally. 

Over the last century mathematicians have made enormous progress in iden- 
tifying possible symmetry groups. We have classified the simple Lie algebras and 
finite simple groups, and although there is much left to learn about group theory 
and representation theory, our collective knowledge is both extensive and broadly 
applicable. Unfortunately, our understanding of the genetics of the regular figures 
lags behind. Much is known, but far more remains to be discovered, and many 
natural questions seem totally intractable. 
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Optimization provides a framework for this problem. How much symmetry and 
order should we expect in the solution of an optimization problem? It is natural to 
guess that the solutions of a highly symmetric problem will inherit the symmetry 
of the problem, but that is not always the case. For a toy example, consider the 
Steiner tree problem for a square, i.e., how to connect all four vertices of a square to 
each other via curves with minimal total length. The most obvious guess connects 
the vertices by an X, which displays all the symmetries of the square, but it is 
suboptimal. Instead, in the optimal solutions the branches meet in threes at 120° 
angles (this is a two-dimensional analogue of the behavior of soap films): 



Note that the symmetry of the square is broken in each individual solution, but of 
course the set of both solutions retains the full symmetry group. 

It is tempting to use symmetry to help solve problems, or at least to guess the 
answers, but as the Steiner tree example shows, this approach can be misleading. 
One of the most famous mistaken cases was the Kelvin conjecture on how to divide 
three-dimensional space into infinitely many equal volumes with minimal surface 
area between them, to create a foam of soap bubbles. In 1887 Kelvin conjec- 
tured a simple, symmetrical solution, obtained by deforming a tiling of space with 
truncated octahedra. (The deformation slightly curves the hexagonal facets into 
monkey saddles, so that the foam has the appropriate dihedral angles.) Kelvin's 
conjecture stood unchallenged for more than a century, but in 1994 Weaire and 
Phelan found a superior solution with two irregular types of bubble^ [54]. This 
shows the danger of relying too much on symmetry: sometimes it is a crucial clue 
as to the true optimum, but sometimes it leads in the wrong direction. 

In many cases the symmetries that are broken are as interesting as the symme- 
tries that are preserved. For example, crystals preserve some of the translational 
symmetries of space, but they dramatically break rotational symmetry, as well as 
most translational symmetries. This symmetry breaking is remarkable, because 
it entails long-range coordination: somehow widely separated pieces of the crystal 
nevertheless align perfectly with each other. A complete theory of crystal for- 
mation must therefore deal with how this coordination could come about. Here, 
however, we will focus on optimization problems and their solutions, rather than 
on the physical or algorithmic processes that might lead to these solutions. 

1.2. Exceptional symmetry: Eg and the Leech lattice. Certain 
mathematical objects, such as the icosahedron, have always fascinated mathemati- 
cians with their elegance and symmetry. These objects stand out as extraordinary 

^Their foam structure was the inspiration for the Beijing National Aquatics Center, used in 
the 2008 Olympics. 
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and have inspired much deep mathematics (see, for example, Fehx Klein's Lectures 
on the Icosahedron [34]). They are the sorts of objects one hopes to characterize 
and understand via the genetics of the regular figures. 

These objects are often exceptional cases in classification theorems. In many 
different branches of mathematics, highly structured or symmetric objects can be 
classified into several regular, predictable families together with a handful of ex- 
ceptions, such as the exceptional Lie algebras or sporadic finite simple groups. 
For most applications, the infinite families play the leading role, and one might 
be tempted to dismiss the exceptional cases as aberrations of limited importance, 
specific to individual problems. Instead, although they are indeed peculiar, the 
exceptional cases are not merely isolated examples, but rather recurring themes 
throughout mathematics, with the same exceptions occurring in seemingly unre- 
lated problems. This phenomenon has not yet been fully understood, although 
much is known about particular cases. 

For example, ADE classifications (i.e., simply-laced Dynkin diagrams) occur in 
many different mathematical areas, including finite subgroups of the rotation group 
50(3), representations of quivers of finite type, certain singularities of algebraic 
hypersurfaces, and simple critical points of multivariate functions. In each case, 
there are two infinite families, denoted An and Z3„, and three exceptions -Eg, -Ey, 
and -Eg , with each type naturally described by a certain Dynkin diagram. See |31j 
for a survey. This means Eg, for example, has a definite meaning in each of these 
problems. For example, among rotation groups it corresponds to the icosahedral 
group, and among simple critical points of functions from R" to M it corresponds 
to the behavior of at the origin. 

In this survey, we focus primarily on two exceptional structures, namely the Eg 
root lattice in and the Leech lattice in R^^. These objects bring together numer- 
ous mathematical topics, including sphere packings, finite simple groups, combina- 
torial and spherical designs, error-correcting codes, lattices and quadratic forms, 
mathematical physics, harmonic analysis, and even hyperbolic and Lorentzian ge- 
ometry. They are far too rich and well connected to do justice to here; see [21] 
for a much longer account as well as numerous references. Here, we will examine 
how to characterize Eg and the Leech lattice, as well as some of their relatives, 
by optimization problems. These objects are special because they solve not just a 
single problem, but rather a broad range of problems. This level of breadth and 
robustness helps explain the widespread occurrences of these structures within 
mathematics. At the same time, it highlights the importance of understanding 
which problems have extraordinarily symmetric solutions and which do not. 

1.3. Energy minimization. Much of physics is based on the idea of en- 
ergy minimization, which will play a crucial role in this article. In many systems 
energy dissipates through forces such as friction, or more generally through heat 
exchange with the environment. Exact energy minimization will occur only at zero 
temperature; at positive temperature, a system in contact with a heat bath (a vast 
reservoir at a constant temperature, and with effectively infinite heat capacity) 
will equilibrate to the temperature of the heat bath, and its energy will fluctuate 
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randomly, with its expected value increasing as the temperature increases. 

One can describe the behavior of such a system mathematically using Gibbs 
measures, which are certain probability distributions on its states. For simplicity, 
imagine a system with n different states numbered 1 through n, where state i 
has energy Ei. For each possible expected value E for energy, the corresponding 
Gibbs measure is the maximal entropy probability measure constrained to have 
expected energy E. In other words, it assigns probability pi to state i so that 
the entropy J2^=i ~Pi^'^SPi maximized subject to J2^=iPi^i — ^- (For the 
motivation behind the definition of entropy, see |33j.) 

A Lagrange multiplier argument shows that when min^ i?.; < E < maxi Ei, 
the probability Pi must equal e~'^^VX]j=i e"''^^ for some constant /3, where /3 is 
chosen so that the expected energy equals E. In physics terms, /3 is proportional 
to the reciprocal of temperature, and only nonnegative values of /3 are relevant 
(because energy is usually not bounded above, as it is in this toy model). As the 
temperature tends to infinity, /3 tends to zero and the system will be equidistributed 
among all states. As the temperature tends to zero, (3 tends to infinity, and the 
system will remain in its ground states, i.e., those with the lowest possible energy. 

In this article, we will focus on systems of point particles interacting via a pair 
potential function. In other words, the energy of the system is the sum over all pairs 
of particles of some function depending only on the relative position of the pair 
(typically the distance between them). For example, in classical electrostatics, it is 
common to study identical charged particles interacting via the Coulomb potential, 
i.e., with potential energy 1/r for a pair of particles at distance r. 

Many other mathematical problems can be recast in this form, even sometimes 
in ways that are not immediately apparent. For a beautiful although tangential 
example, consider the distribution of eigenvalues for a random nxn unitary matrix, 
chosen with respect to the Haar measure on U (n) . These eigenvalues are unit 
complex numbers zi, . . . , z^, and the Weyl integral formula says that the induced 
probability measure on them has density proportional to 



(see [2l!)- If we define the logarithmic potential — log \ zi — Zj\ between Zi and zj, 
then this measure is the Gibbs measure with /3 = 2 for n particles on the unit circle. 
The logarithmic potential is natural because it is a harmonic function on the plane 
(much as the Coulomb potential x i— >■ l/|x| is harmonic in three dimensions). Thus, 
the eigenvalues of a random unitary matrix repel each other through harmonic 
interactions, and the Weyl integral formula specifies the temperature 1/(3. 

In the following survey, we will focus on the case of zero temperature. In the 
real world, all systems have positive temperature, which raises important questions 
about dynamics and phase transitions. However, for the purposes of understanding 
the role of symmetry, zero temperature is a crucial case. 



n I 




l<i<j<~n 



1.4. Packing and information theory. The prototypical packing prob- 
lem is sphere packing: how can one arrange non-overlapping, congruent balls in 
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Euclidean space to fill as large a fraction of space as possible? The fraction of 
space filled is the density. Of course, it must be defined by a limiting process, by 
looking at the fraction of a large ball or cube that can be covered. 

Packing problems fit naturally into the energy minimization framework via 
hard-core potentials, which are potentials that are infinite up to a certain radius 
r and zero at or beyond it. In other words, there is an infinite energy penalty 
for points that are too close together, but otherwise there is no effect. Under 
such a potential function, a collection of particles has finite energy if and only if 
the particles are positioned at the centers of non-overlapping balls of radius r/2. 
Note that every packing (not just the densest) minimizes energy, but knowing the 
minimal energy for all densities solves the packing problem. 

From this perspective, one can formulate questions that are even deeper than 
densest packing questions. For example, at any fixed density, one can ask for a 
random packing at that density (i.e., a sample from the Gibbs measure at zero 
temperature). For which densities is there long-range order, i.e., nontrivial corre- 
lations between distant particles? In two or three dimensions, the densest packings 
are crystalline, and there appears to be considerable order even below the max- 
imal density, with a phase transition between order and disorder as the density 
decreases. (See [41] and the references cited therein for more details.) It is far 
from clear what happens in high dimensions, and the densest packings might be 
disordered [FT] . 

Packings of less than maximal density are of great importance for modeling 
granular materials, because most such materials will be somewhat loose. The 
fact that long-range order seemingly persists over a range of densities means it 
can potentially be observed in the real world, where even under high pressure no 
packing is ever truly perfect. (Of course, for realistic models there are many other 
important refinements, such as variation in particle sizes and shapes.) 

In addition to being models for granular materials, packings play an important 
role in information theory, as error-correcting codes for noisy communication chan- 
nels. Suppose, for a simplified example, that we wish to communicate by radio. 
We can measure the signal strength at n different frequencies and represent it as 
an n-dimensional vector. Note that n may be quite large, so high-dimensional 
packings are especially important here. The power required to transmit a signal 
X G will be proportional to jxp, so we must restrict our attention to signals 
that lie within a ball of radius r centered at the origin, where r depends on the 
power level of our transmitter. 

If we transmit a signal, then the received signal will be slightly perturbed due 
to noise. We can measure the noise level of the channel by e, so that when x is 
transmitted, with high probability the received signal x' will satisfy |a; — < £. 
In other words, if the open balls of radius e about signals x and y do not overlap, 
then with high probability the received signals x' and y' cannot be confused. 

To ensure error-free communication, we will rely on a restricted vocabulary of 
possible signals that cannot be confused with each other (i.e., an error-correcting 
code). That means they must be the centers of non-overlapping balls of radius e. 
For efficient communication, we wish to maximize the number of signals available 
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Figure 1. Two layers in a three-dimensional sphere packing, one denoted by shaded circles 
and the other by unshaded circles. Notice that the unshaded layer sits above half of the 
holes in the shaded layer. 



for use, i.e., the number of such balls whose centers lie within a ball of radius r. 
In the limit as r/e tends to infinity, that is the sphere packing problem. 

1.5. Outline. The remainder of the paper is organized as follows. Sections [2] 
and [3] survey packing and energy minimization problems in more depth. Sections [4] 
and [5] outline the proofs that certain exceptional objects solve these problems. 
Finally, Section |6] offers areas for future investigation. 



2. Packings and codes 

2.1. Sphere packing in low and high dimensions. One can study 
the sphere packing problem in any dimension. In it is trivial, because the line 
can be completely covered with intervals. In M? , it is easy to guess that a hexagonal 
arrangement of circles is optimal, with each circle tangent to six others, but giving a 
rigorous proof of optimality is not completely straightforward and was first achieved 
in 1892 by Thue [50] (see [29] for a short, modern proof). In R'^, the usual way 
oranges are stacked in grocery stores is optimal, but the proof is extraordinarily 
difficult. Hales completed a proof in 1998, with a lengthy combination of human 
reasoning and computer calculations [SQ^- One conceptual difficulty is that the 
solution is not at all unique in R'^. In a technical sense, it is not unique in any 
dimension (even up to isometrics), because density is a global property that is 
unchanged by, for example, removing a ball. However, in three dimensions there 
is a much deeper sort of non-uniqueness. One can form an optimal packing by 
stacking hexagonal layers, with each layer nestled into the gaps in the layer beneath 
it. As shown in Figured] the holes in a hexagonal lattice consist of two translates 
of the original lattice, and the next layer will sit above one of these two translates. 
For each layer, a binary choice must be made, and there are uncountably many 
ways to make these choices. (Each will be isometric to countably many others, 
but there remain uncountably many geometrically distinct packings, with many 
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different symmetry groups.) All these packings are equally dense and perfectly 
natural. See [22] for a discussion of this issue in higher dimensions. 

In four or more dimensions, no sharp density bounds are known. Instead, we 
merely have upper and lower bounds, which differ by a substantial factor. For 
example, in R^^, the best upper bound known is more than 58 times the density of 
the best packing known |16) . This factor grows exponentially with the dimension: 
the best lower bound known is a constant times n2~^ in R" (see [7] and [52]). 
while the upper bound is (1.514724 . . . + o(l))-" (see [32]). 

It may be surprising that these densities are so low. One way to think about it 
is in terms of volume growth in high dimensions. An e-neighborhood of a ball in 
R" has volume (1 + e)" times that of the ball, so when n is large, there is far more 
volume near the surface of the ball than actually inside it. In low-dimensional 
sphere packings, most volume is contained within the balls, with a narrow fringe 
of gaps between them. In high-dimensional packings, the gaps occupy far more 
volume. 

It is easy to prove a lower bound of 2~" for the sphere packing density in M". 
In fact, this lower bound holds for every saturated packing (i.e., one in which there 
is no room for any additional spheres): 

Lemma 2.1. Every saturated sphere packing in R" has density at least 2~". 

Proof. Suppose the packing uses spheres of radius r. No point in space can be 
distance 2r or further from the nearest sphere center, since otherwise there would 
be room to center another sphere of radius r at that point. This means we can 
cover space completely by doubling the radius of each sphere. Doubling the radius 
multiplies the volume by 2", and hence multiplies the density by at most 2" (in fact, 
exactly 2" if we count overlaps with multiplicity). Because the enlarged spheres 
cover all of space, the original spheres must cover at least a 2~" fraction. □ 

This argument sounds highly constructive (simply add more spheres to a pack- 
ing until it becomes saturated), and indeed it is constructive in the logical sense. 
However, in practice it offers almost no insight into what dense packings look like, 
because it is difficult even to tell whether a high-dimensional packing is saturated. 

In fact, it is completely unclear how to construct dense packings in high dimen- 
sions. One might expect the sphere packing problem to have a simple, uniform 
solution that would work in all dimensions. Instead, each dimension has its own 
charming idiosyncrasies, as we will see in Section [2^ There is little hope of a sys- 
tematic solution to the sphere packing problem in all dimensions. Even achieving 
density 2~" through a simple, explicit construction is an unsolved problem. 

2.2. Lattices and periodic packings. The simplest sorts of packings 
are lattice packings. Recall that a lattice in R" is the integral span of a basis (i.e., 
it is a grid, possibly skewed). To form a sphere packing, one can center a sphere at 
each lattice point. The radius should be half the minimal distance between lattice 
points, so that the nearest spheres are tangent to each other. 

There is no reason to expect that lattice packings should be the densest sphere 
packings, and they are probably not optimal in sufficiently high dimensions (for 
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example, ten dimensions). However, lattices are very likely optimal in M" for n < 9 
and for some higher values of n (including 12, 16, and 24). See [24] for more details 
about lattices and packings in general. 

For n < 8 and n — 24, the lattice packing problem has been solved in R". 
In fact, the densest lattices are unique in these dimensions (up to scaling and 
isometrics), although that may not be true in every dimension, such as n = 25. 
For n < 8, the optimal lattices are all root lattices, the famous lattices that arise in 
Lie theory and are classified by Dynkin diagrams. Specifically, the densest lattices 
are Ai (the integer lattice), A2 (the hexagonal lattice), A3 (the face-centered cubic 
lattice, which is also isomorphic to -D3), D4, D^, Eq, Ej, and Eg- For n = 24, the 
Leech lattice is an optimal lattice packing; the proof will be discussed in Section [SJ 

The Dn lattices are particularly simple, because they are formed by a checker- 
board construction as a sublattice of index 2 in Z": 

Dn = {{xi,...,Xn) e Z" : xi + • ■■ + .x„ = (mod 2)}. 

To see why Dn is not optimal in high dimensions, consider the holes in D„, i.e., the 
points in space that are local maxima for distance from the lattice. The integral 
points with odd coordinate sum are obvious candidates, and they are indeed holes, 
at distance 1 from _D„. However, there's a slightly more subtle case, namely the 
point (1/2, 1/2, . . . , 1/2) and its translates by £>„. These points are at distance 

from Dn- When n = 8, this distance becomes ^/2, which is equal to the minimal 
distance between points in Dg. That means these deep holes have become large 
enough that additional spheres can be placed in them. Doing so yields the Eg root 
lattice, whose density is twice that of Dg- (The Eq and E-/ lattices are certain 
cross sections of i^s-) 

The Es and Leech lattices stand out among lattice packings, because all the 
spheres fit beautifully into place in a remarkably dense and symmetric way. There 
is no doubt that they are optimal packings in general, not just among lattices. 
Harmonic analysis ought to provide a proof, but as we will see in Section [5l a full 
proof has been elusive. 

Periodic packings form a broader class of packings than lattice packings. A 
lattice can be viewed as the vertices of a tiling of space with parallelotopes (fun- 
damental domains for the action by translation), but there's no reason to center 
spheres only at the vertices. More generally, one can place them in the interior, 
or elsewhere on the boundary, and then repeat them periodically; such a packing 
is called a periodic packing. Equivalently, the sphere centers in a periodic packing 
form the union of finitely many translates of a lattice. 

The Es packing, as defined above, is clearly periodic (the union of two translates 
of Dg). It is not quite as obvious that it is actually a lattice, but that is easy to 
check. The Leech lattice in R^^ can be defined by a similar, but more elaborate, 
construction involving filling in the holes in a lattice constructed using the binary 
Golay code (see [38] and Section 4.4 in Chapter 4 of [24]). 
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Philosophically, the construction of Es given above is somewhat odd, because 
Eg itself is extraordinarily symmetrical, but the construction is not. Instead, it 
builds Eg in two pieces. This situation is actually quite common when constructing 
a highly symmetric object. By neglecting part of the symmetry group, one can 
decompose the object into simpler pieces, which can each be understood separately. 
However, eventually one must exhibit the extra symmetry. The symmetry group 
of Eg is generated by the reflections in the hyperplanes orthogonal to the minimal 
vectors of Eg, and one can check that it acts transitively on those minimal vectors. 

It is not known whether periodic packings achieve the maximal packing density 
in every dimension. However, they always come arbitrarily close: given any dense 
packing, one can take a large, cubical piece of it and repeat that piece periodically. 
To avoid overlaps, it may be necessary to remove some spheres near the boundary, 
but if the cube is large enough, then the resulting decrease in density will be small. 

By contrast, it is not even known whether there exist saturated lattice packings 
in high dimensions. If not, then lattices cannot achieve more than half the maximal 
density, because one can double the density of a non-saturated lattice by filling in a 
hole together with all its translates by lattice vectors. It seems highly unlikely that 
there are saturated lattices in high dimensions, because a lattice is specified by a 
quadratic number of parameters, while there is an exponential volume of space in 
which holes could appear, so there are not enough degrees of freedom to control all 
the possible holes. However, this argument presumably cannot be made rigorous. 

Despite all the reasons to think lattices are not the best sphere packings in high 
dimensions, the best asymptotic lower bounds known for sphere packing density use 
lattices. Ball's bound 2{n — 1)2^" in R" holds for lattice packings [7], and Vance's 
bound, which improves it by an asymptotic factor of 3/e when n is a multiple of 
four, uses not just lattices, but lattices that are modules over a maximal order 
in the quaternions (55]. Imposing algebraic structure may rule out the densest 
possible packings, but it makes up for that by offering powerful tools for analysis 
and proof. 

2.3. Packing problems in other spaces. Packing problems are inter- 
esting in many metric spaces. The simplest situation is when the ambient space 
is compact, in which case the packing will involve only finitely many balls. The 
packing problem can then be formulated in terms of two different optimization 
problems for a finite subset of the metric space: 

1. What is the largest possible minimal distance between N points? 

2. What is the largest possible size of a subset whose minimal distance is at 
least r? 

The first fixes the number of balls and maximizes their size, while the second fixes 
the radius r/2 of the balls and maximizes the number. In Euclidean space, if we 
interpret the number of points as the number of points per unit volume, then both 
problems are the same by scaling invariance, but that docs not hold in compact 
spaces. The two problems are equivalent, however, in the sense that a complete 
answer to one (for all values of r or N) yields a complete answer to the other. 
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Packing problems arise naturally in many compact metric spaces, including 
spheres, projective spaces, Grassmannians [23l|4], and the Hamming cube {0, 1}" 
(under Hamming distance, so packings are binary error-correcting codes). For a 
simplified example, suppose one wishes to treat a spherical tumor by beaming 
radiation at it. One would like to use multiple beams approaching it from different 
angles, so as to minimize radiation exposure outside of the tumor, and the problem 
of maximizing the angle between the beams is a packing problem in RP^ . 

Packing problems are also important in non-compact spaces, but aside from 
Euclidean space we will not deal with them in this article, because defining density 
becomes much more subtle. See, for example, the foundational work by Bowen and 
Radin on defining packing density in hyperbolic space [TT| . 

Packings on the surface of a sphere are known as spherical codes. Specifically, 
an optimal spherical code is an arrangement of points on a sphere that maximizes 
the minimal distance among configurations of its size. Spherical codes can be used 
as error-correcting codes (for example, in the toy model of radio transmission from 
Section[L4l they are codes for a constant-power transmitter), and they also provide 
an elegant way to help characterize the many interesting spherical configurations 
that arise throughout mathematics. 

One of the most attractive special cases of packing on a sphere is the kissing 
problem. How many non-overlapping unit balls can all be tangent to a central 
unit ball? The points of tangency on the central ball form a spherical code with 
minimal angle at least 60°, and any such code yields a kissing configuration. 

In M^, the kissing number is clearly six, but the answer is already not obvious 
in M.^. The twelve vertices of an icosahedron work, but the tangent balls do not 
touch each other and can slide around. It turns out that there is no room for 
a thirteenth ball, but that was first proved only in 1953 by Schiitte and van der 
Waerden [47] . 

In M^, Musin [52] showed that the kissing number is 24, but the answer is not 
known in M.^ (it appears to be 40). In fact, the only higher dimensions for which the 
kissing problem has been solved are 8 and 24, independently by Levenshtein [39] 
and by Odlyzko and Sloane [43]. The kissing numbers are 240 in and 196560 
in K^"*. Furthermore, these kissing configurations are unique up to isometrics [9]. 

The kissing number of 240 is achieved by the Es root lattice through its 240 min- 
imal vectors. Specifically, there are (2) -2^ = 112 permutations of (±1, ±1, 0, . . . , 0) 
and 2'' — 128 vectors of the form (±1/2, . . . , ±1/2) with an even number of minus 
signs. Thus, Es is not only the densest lattice packing in R®, but it also has the 
highest possible kissing number. Similarly, the Leech lattice in M.'^* achieves the 
kissing number of 196560. 

In general, however, there is no reason to believe that the densest packings will 
also have the highest kissing numbers. The packing density is a global property, 
while the kissing number is purely local and might be maximized in a way that 
cannot be extended to a dense packing. That appears to happen in many dimen- 
sions [24]. Instead of being typical, compatibility between the optimal local and 
global structures is a remarkable occurrence. 
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Figure 2. Energy minimization on an actual (approximate) sphere: tiny, electrically 
charged PMMA beads collecting on the interface between water and cyclohexyl bromide. 
[Courtesy of W. Irvine and P. M. Chaikin, New York University] 

3. The Thomson problem and universal optimality 

3.1. Physics on surfaces. The Thomson problem [49l p. 255] asks for the 
minimal-energy configuration of N classical electrons confined to the unit sphere 
. In other words, the particles interact via the Coulomb potential l/r at Eu- 
clidean distance r. This model was originally intended to describe atoms, before 
quantum mechanics or even the discovery of the nucleus. Thomson hoped it would 
explain the periodic table. Of course, subsequent discoveries have shown that it is a 
woefully inadequate atomic model, but it remains of substantial scientific interest, 
and its variants describe many real- world systems. 

For example, imagine mixing together two immiscible liquids, such as oil and 
water. The oil will break up into tiny droplets, evenly dispersed within the water, 
but they will rapidly coalesce and the oil will separate from the water. Cooks have 
long known that one can prevent this separation by using emulsifiers. One type of 
emulsion is a Pickering emulsion, in which tiny particles collect on the boundaries 
of oil droplets, which prevents coalescence (the particles bounce off each other). 

More generally, colloidal particles often adsorb to the interface between two 
different liquids. See, for example. Figure [2l which shows charged particles made 
of polymethyl methacrylate (i.e., plexiglas) in a mixture of water and cyclohexyl 
bromide. Notice that the particles on the surface of the droplet have spread out into 
a fairly regular arrangement due to their mutual repulsion, and they are repelling 
the remaining particles away from the surface. 

These particles are microscopic, yet large enough that they can accurately be 
described using classical physics. Thus, the generalized Thomson problem is an 
appropriate model. See [12] for more details on these sorts of materials. 

Consider the case of particles on the unit sphere in R". Given a finite subset 
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C C S'^ ^ and a potential function f : (0, 4] — >■ K, define the potential energy by 

x.,y£C 

For each positive integer N and each /, we seek an iV-element subset C C S"'~^ 
that minimizes Ef{C) compared to all other choices of C with \C\ = N. The use of 
squared distance instead of distance is not standard in physics, but it will prove 
mathematically convenient. The function / is defined only on (0, 4] because no 
squared distance larger than 4 can occur on the unit sphere. 

Typically / will be decreasing (so the force is repulsive) and convex. In fact, the 
most natural potential functions to use are the completely monotonic functions, 
i.e., smooth functions satisfying (— 1)'=/('^) > for all integers fc > 0. For example, 
inverse power laws r l/r* (with s > 0) are completely monotonic. 

3.2. Varying the potential function. As we vary the potential function 
/ above, how do the optimal configurations change? From the physics perspective, 
this question appears silly, because the potential is typically determined by funda- 
mental physics. However, from a mathematical perspective it is a critical question, 
because it places the individual optimization problems into a richer context. 

As we vary the potential function, the optimal configurations will vary in some 
family. This family may not be connected, because the optimum may abruptly 
jump as the potential function passes some threshold, and different components 
may have different dimensions [TS]. Nevertheless, we can use the local dimension 
of the family as a crude measure of the complexity of an optimum: we compute 
the dimension of the space of perturbed configurations that minimize energy for 
perturbations of the potential function. Call this dimension the parameter count 
of the configuration. 

Figure [3] (taken from [8 ) shows the parameter counts for the configurations 
minimizing Coulomb energy on with 2 through 64 points. The figure is doubly 
conjectural: in almost all of these cases, no proof is known that the supposed 
optima are truly optimal or that the parameter counts are correct. However, the 
experimental evidence leaves little doubt. 

One can see from Figure |3] that the parameter counts vary wildly. For exam- 
ple, for 43 points there are 21 parameters, while for 44 points there is only 1. 
This suggests that the 44-point optimizer will be substantially simpler and more 
understandable, and indeed it is (see [8j). 

3.3. Universal optimality. When one varies the potential function, the 
simplest case is when the optimal configuration never varies. Call a configuration 
universally optimal if it minimizes energy for all completely monotonic potential 
functions. 

A universal optimum is automatically an optimal spherical code: for the po- 
tential function /(r) = with s large, the energy is asymptotically determined 
by the minimal distance, and minimizing energy requires maximizing the minimal 
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Figure 3. Parameter counts for conjectural Coulomb-energy minimizers on . For 
comparison, the white circles show the dimension of the space of all configurations. 

distance. However, optimal spherical codes are rarely universally optimal. For 
every number of points in every dimension, there exists some optimal code, but 
universal optima appear to be far less common. 

In , there is an A^-point universal optimum for each N, namely the vertices of 
a regular 7V-gon. In S*^, the situation is more complicated. Aside from degenerate 
cases with three or fewer points, there are only three universal optima, namely the 
vertices of a regular tetrahedron, octahedron, or icosahedron [T^ . The cube and 
dodecahedron are not even optimal, let alone universally optimal, since one can 
lower energy by rotating a facet. 

The first case for which there is no universal optimum is five points in S^. 
There are two natural configurations: a triangular bipyramid, with an equilateral 
triangle on the equator together with the north and south poles, and a square 
pyramid, with its top at the north pole and its base slightly below the equator. 
This second family depends on one parameter, the height of the pyramid. The 
triangular bipyramid is known to minimize energy for several inverse power laws 
[48] ■ but it is not even a local minimum when they are sufficiently steep, in which 
case square pyramids seem to become optimal. 

Conjecture 3.1. For every completely monotonic potential function, either the 
triangular bipyramid or a square pyramid minimizes energy among five-point con- 
figurations in . 

For 71 > 4, the universal optima in S*"^^ have not been completely classified. 
Table [T] shows a list of the known cases (proved in [17]). Each of them is a fasci- 
nating mathematical object. For example, the 27 points in correspond to the 
27 lines on a cubic surface. 

The first five lines in the table list the regular polytopes with simplicial facets. 
The next four lines list the Eg root system and certain semiregular polytopes 
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n 


N 


Description 


n 


N <n+l 


regular simplex 


n 


2n 


regular cross polytope 


2 


N 


regular A^-gon 


3 


12 


regular icosahedron 


4 


120 


regular 600-cell 


5 


16 


liemicube 


6 


27 


Sclilafli graph. 


7 


56 


28 equiangular lines 


8 


240 


Es root system 


21 


112 


isotropic subspaces 


21 


162 


strongly regular graph 


22 


100 


Higman-Sims graph 


22 


275 


McLaughlin graph 


22 


891 


isotropic subspaces 


23 


552 


276 equiangular lines 


23 


4600 


iterated kissing configuration 


24 


196560 


Leech lattice minimal vectors 


g('z' + i)/(9 + i) 


(g + l)(g3 + l) 


isotropic subspaces {q is a prime power) 



Table 1. The known A'^-point universal optima in S" 

obtained as cross sections. The next eight lines list the minimal vectors of the 
Leech lattice and certain cross sections. If this were the complete list, it would feel 
reasonable, but the last line is perplexing. It describes another infinite sequence 
of universal optima, constructed from geometries over ¥q in [13j and recognized as 
optimal codes in [40]. How many more such cases remain to be constructed? 

Another puzzling aspect of Table [T] is the gap between 8 and 21 dimensions. 
Are there really no universal optima in these dimensions, aside from the simplices 
and cross polytopes? Or do we simply lack the imagination needed to discover 
them? Extensive computer searches [5] suggest that the table is closer to complete 
than one might expect, but probably not complete. Specifically, there are a 40- 
point configuration in and a 64-point configuration in S-^^ that appear to be 
universally optimal, but these are the only conjectural cases that have been located. 

Almost all of the results tabulated in Tabic [Dean be deduced from the following 
theorem. It generalizes a theorem of Levenshtein 00] , which says that these con- 
figurations are all optimal codes. The one known case not covered by the theorem 
is the regular 600-cell, which requires a different argument |17) . 

To state the theorem, we will need two definitions. A spherical k-design in 
5""^ is a finite subset V of the sphere such that for every polynomial p: M" — )■ M 
of total degree at most k, the average of p over V equals its average over the entire 
sphere. Spherical fc-designs can be thought of as sets giving quadrature rules (i.e., 
numerical integration schemes) that are exact for polynomials of degree up to k. 
An m-distance set is a set for which m distances occur between distinct points. 
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Theorem 3.2 (Cohn and Kumar [17j). Every m-distance set that is a spherical 
(2m — l)-design is universally optimal. 

The proof of this theorem uses hnear programming bounds, which are developed 
in the next section. 



4. Proof techniques: linear programming bounds 

4.1. Constraints on the pair correlation function. In this section, 
we will discuss techniques for proving lower bounds on potential energy. In partic- 
ular, we will develop linear programming bounds and briefly explain how they are 
used to prove Theorem 13.21 

They are called "linear programming bounds" because linear programming can 
be used to optimize them, but no knowledge of linear programming is required to 
understand how the bounds work. They were originally developed by Delsarte for 
discrete problems in coding theory [25], extended to continuous packing problems 
in [26j |32] , and adapted for potential energy minimization by Yudin and his collab- 
orators [55l |35l [36l [U [2] . In this section, we will focus on spherical configurations, 
although the techniques work in much greater generality. 

Given a finite subset C of S"""^, define its distance distribution by 



where (•, •) denotes the inner product in M". In physics terms, A is the pair 
correlation function; it measures how often each pairwise distance occurs (the inner 
product is a natural way to gauge distance on the sphere). Linear programming 
bounds are based on proving certain linear inequalities involving the numbers At. 
These inequalities are crucial because the potential energy can be expressed in 
terms of the distance distribution A by 



since — 2 — 2{x,y). (Although (|4.1[) sums over uncountably many values of 
t, only finitely many of the summands are nonzero.) Energy is a linear function of 
A, and the linear programming bound is the minimum of this function subject to 
the linear constraints on A, which makes it the solution to a linear programming 
problem in infinitely many variables. 

To begin, there are several obvious constraints on the distance distribution. 
Let N ^\C\. Then At > for ah t, At ^ for \t\ > 1, Ai = N, and J2t = ^"^ ■ 

The power of linear programming bounds comes from less obvious constraints. 
For example, -^tt > 0. To see why, notice that 



At^\{ix,y)eC^ : {x,y)^t} 




(4.1) 



x.y^C 



-l<t<l 



2 



Y^Att^ Y,{x,y)= 



X 



> 0. 
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More generally, there is an infinite sequence of polynomials (independent of C, but 
depending on the dimension n) Pj^*, P", P^, . . . , with degP^ = k, such that for 
each k, 



(In fact, we can take P^'{t) = 1, Pi"(t) = t, and P^{t) ^ - l/n.) This inequality 
is nontrivial, because these polynomials are frequently negative. For example, 
looks like this: 



The polynomials P^ are called ultraspherical polynomials, and they are char- 
acterized by orthogonality on the interval [—1, 1] with respect to (1 — i2^(n-3)/2 
In other words, for i ^ j, 



This relationship determines the polynomials up to scaling, as the Gram-Schmidt 
orthogonalization of the monomials l,t,t^, . . . with respect to this inner product. 
The sign of the scaling constant is determined by P^'(l) > 0, and the magnitude 
of the constant is irrelevant for (|4.2[) . 

In fact, these polynomials have a far stronger property than just (j4.2p : they 
are positive-definite kernels. That is, for any N and any points xi, . . . , xm € S'^~^, 
the N X N matrix (P^((a;i, Xj))) ^.^^ is positive semidefinite. This implies (j4.2p 
because the sum of the entries of a positive-semidefinite matrix is nonnegative. 
Schoenberg |45| proved that every continuous positive-definite kernel on S"^^ must 
be a nonnegative linear combination of ultraspherical polynomials. 

4.2. Zonal spherical harmonics. As an illustration of the role of repre- 
sentation theory, in this section we will derive the ultraspherical polynomials as 
zonal spherical harmonics and verify that they satisfy (j4.2l) . The reader who is 
willing to take that on faith can skip the derivation. 

The orthogonal group 0{n) acts on S"~^ by isometrics, and hence i^(S'"~^) is 
a unitary representation of 0{n). To begin, we will decompose this representation 
into irreducibles. Let Vk be the subspace of functions on S*"^^ defined by poly- 
nomials on R" of total degree at most k. We have Pq C Pi C • • • , and each Vk 
is a finite-dimensional representation of 0{n), with [Ji^Pk dense in i^(5"~^). To 
convert this filtration into a direct sum decomposition, let Vq — Po and define Vk 
to be the orthogonal complement of Vb ® Vi ® • • • ® Vk-i within Pk (with respect 




(4.2) 
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to the usual inner product on L'^^S'^ ^)). Then Vk is stih preserved by 0{n), and 
the entire space breaks up as 

fc>0 

(The hat indicates the completion of the algebraic direct sum.) The functions in 
Vk are known as spherical harmonics of degree k, because Vk is an eigenspace of 
the spherical Laplacian, but we will not need that characterization of them. 

For each x € 5""^, evaluating at x defines a linear map / i— > f{x) on Vk- Thus, 
there exists a unique vector Vk^x G Vk such that for all / G 14, 

f{x) = {f,Vk,x), 

where (•, •) denotes the inner product on Vk from L^^S*""^). The map x i-> Vk,x is 
called a reproducing kernel. 

For each T £ 0{n) and f e Vk, 

{f,Vk,Tx) = fiTx) = {T-\f)ix) = {T-^f,Vk,x) - {f,Tvk,x). 

Thus, Tvk,x — Vk,Tx, by the uniqueness of Vk^Tx- It follows that Vk,x is invariant 
under the stabilizer of x in 0{n). In other words, it is invariant under rotations 
about the axis through ±x, so it is effectively a function of only one variable, the 
inner product with x. Such a function is called a zonal spherical harmonic. 
We can define PJ} by 

These polynomials certainly satisfy (|4.2|) . because 



P'kii^^V))^ X! '"kAv)^ X! i'"k,X,Vk,y) 

x,y£C x.yGC x.yGC 



x£C 



>o. 



and in fact they are positive-definite kernels because (^PJ^{{xi,Xj)))^^..^^ is the 
Gram matrix of the vectors Vk,Xi ■ 

The functions uo,x,wi,2:, • ■ • are in orthogonal subspaces, and hence the poly- 
nomials Pq,Pi,... must be orthogonal with respect to the measure on [—1,1] 
obtained by projecting the surface measure of S*""^ onto the axis from —x to 
X. The following simple calculation shows that the measure is proportional to 
(1 — t^)("~'^)/^ dt. Consider the spherical shell defined by 

1 < H Vxl<l + e. 

If we set xi = t, then the remaining coordinates satisfy 

1 - < xl + ■ ■ ■ + xl < 1 - + e, 

and the volume is proportional to (1 — — (1 — <2yn-i)/2^ divide 

by s to normalize, then as e — > we find that the density of the surface measure 
with xi ~ t is proportional to (1 — i2-j(n-3)/2^ desired. 
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The degree of PJ} is at most k, and because Vk,x is orthogonal to Vk-i, the 
degree can be less than k only if PJ} is identically zero. That cannot be the case 
(for n > 1), since otherwise evaluating at x would be identically zero. If it were, 
then it would follow from Tvk^x = Vk,Tx that evaluating at each point is identically 
zero, and thus that Vk is trivial. However, Vk 'Pfc-i, and hence Vjt is nontrivial. 

Thus, the polynomials P'^ defined above have degree fc, satisfy (|4.2p . and have 
the desired orthogonality relationship. 

4.3. Linear programming bounds. Let C c 5"^^ be a finite subset 
and let A be its distance distribution. To make use of the linear constraints on A 
discussed in Section 14.11 we will use the dual linear program. In other words, we 
will take linear combinations of the constraints so as to obtain a lower bound on 
energy. 

We introduce new real variables and /3t specifying which linear combination 
to take. Suppose we add ao times = Q;fc times 

^ A,P,"(t)>0 
-i<t<i 

(with ttfc > for fc > 1), and fit times the constraint > (with /?< > for 
-1 < i < 1). We find that 

-l<t<l k -l<t<l 

using the normalization P'^(t) — 1. Define h(t) — J2k'^kPk {t)- Then 

J2 At {h{t) + Pt) > aoN^ - h{l)N, 
-i<t<i 

because Ai ~ N. If we choose ak and so that h{t) + (3t — f{2 — 2t)/2 for 
— 1 < i < 1, then the energy will be bounded below by aoN'^ — h{l)N, by (|4.ip . 

The equation h{t)+Pt /(2-2i)/2 just means that h{t) < /(2-2t)/2 (because 
we have assumed only that Pt > 0). Thus, we have proved the following bound: 

Theorem 4.1 (Yudin [55]). Suppose h{t) — akP^it) satisfies ak > for k > 
and h{t) < f(2 - 2t)/2 for -l<t<l. Then for every finite subset C C S'"-^, 

Ef{C)>ao\C\^-h{l)\C\. 

To prove Theorem 13.21 one can optimize the choice of the auxiliary function h 
in Theorem 14. II Suppose C is an m-distance set and a spherical {2m — l)-design, 
and / is completely monotonic. In the proof of Theorem 14. 11 equality holds if and 
only if h{t) — f{2 — 2t)/2 for every inner product t < \ that occurs between points 
in C and yec^fe ((^'2^)) — ^ whenever a/c > and fc > 0. The latter equation 
automatically holds for 1 < fc < 2m— 1 because C is a (2m— l)-design. Let h be the 
unique polynomial of degree at most 2m — 1 that agrees with /(2 — 2t) /2 to order 2 
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at each of the m inner products between distinct points in C, so that h satisfies the 
other condition for equahty. The inequahty h{t) < /(2 — 2t)/2 follows easily from 
a remainder theorem for Hermite interpolation (using the complete monotonicity 
of /). The most technical part of the proof is the verification that the coefficients 
ak of h are nonnegative. For any single configuration, it can be checked directly; 
for the general case, see [TTI . 

4.4. Semidefinite programming bounds. Semidefinite programming 
bounds, introduced by Schrijver (46j and generalized by Bachoc and Vallentin 
[S], extend the idea of linear programming bounds by looking at triple (or even 
higher) correlation functions, rather than just pair correlations. Linear constraints 
are naturally replaced with semidefinite constraints, and the resulting bounds can 
be optimized by semidefinite programming. 

This method is a far-reaching generalization of linear programming bounds, 
and it has led to several sharp bounds that could not be obtained previously 
[6l 121). However, the improvement in the bounds when going from pairs to triples 
is often small, while the computational price is high. One of the most interesting 
conceptual questions in this area is the trade-off between higher correlations and 
improved bounds. When studying A^-point configurations in S"^^ using fc-point 
correlation bounds, how large does k need to be to prove a sharp bound? Clearly 
k = N would suffice, and for the cases covered by Theorem l3.2l it is enough to take 
k = 2. Aside from a handful of cases in which fc = 3 works, almost nothing is known 
in between. (Cases with fc > 4 seem too difficult to handle computationally.) This 
question is connected more generally to the strength of LP and SDP hierarchies 
for relaxations of NP-hard combinatorial optimization problems |37| . 

It is also related to a conjecture of Torquato and Stillinger [51], who propose 
that for packings that are disordered (in a certain technical sense), in sufficiently 
high dimensions the two-point constraints are not only necessary but also sufficient 
for the existence of a packing with a given pair correlation function. They show 
that this conjecture would lead to packings of density (1.715527. . . + o(l))^" in 
R", by exhibiting the corresponding pair correlation functions. The problem of 
finding a hypothetical pair correlation function that maximizes the packing density, 
subject to the two-point constraints, is dual to the problem of optimizing the linear 
programming bounds. 



5. Euclidean space 

5.1. Linear programming bounds in Euclidean space. Linear 
programming bounds can also be applied to packing and energy minimization 
problems in Euclidean space, with Fourier analysis taking the role played by the 
ultraspherical polynomials in the spherical case. In this section, we will focus pri- 
marily on packing, before commenting on energy minimization at the end. The 
theory is formally analogous to that in compact spaces, but the resulting opti- 
mization problems are quite a bit deeper and more subtle, and the most exciting 
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applications of the theory remain conjectures. 

We will normalize the Fourier transform of an function / : M" M by 

fit) ^ f /(:E)e2"<*^"> dx. 

(In this section, / will not denote a potential function.) The fundamental technical 
tool is the Poisson summation formula for a lattice A, which holds for all Schwartz 
functions (i.e., smooth functions all of whose derivatives are rapidly decreasing): 

xeA ^ ' ' teA* 

Here, vol(R"/A) is the volume of a fundamental parallelotope, and A* is the dual 
lattice defined by 

A* = {te M" : (t, x) eZ for all x e A}. 
Given any basis of A, the dual basis with respect to (•, •) is a basis of A*. 

Theorem 5.1 (Cohn and Elkies [IB]). Let f : M" — > R 6e a Schwartz function 
such that /(O) ^ 0. // f{x) < for \x\ > 1 and f{t) > for all t, then the sphere 
packing density in R" is at most 

^"/^ m 

2"(n/2)! ■ /(o)' 

Of course, {n/2)l means T{n/2 + 1) when n is odd. The restriction to Schwartz 
functions can be replaced with milder assumptions [16l [17] . 

The hypotheses and conclusion of Theorem 15.11 are invariant under rotation 
about the origin, so without loss of generality we can symmetrize / and assume 
it is a radial function. Thus, optimizing the bound in Theorem 15.11 amounts to 
optimizing the choice of a function of one (radial) variable. 

It is not hard to prove Theorem 15.11 for the special case of lattice packings. 
Suppose A is a lattice, and rescale so we can assume the minimal vector length is 
1 (i.e., the packing uses balls of radius 1/2). The density is the volume of a sphere 
of radius 1/2, which is 7r"''^/(2"(n/2)!), times the number of spheres occurring 
per unit volume in space. The latter factor is 1/ vol(]R"/A), because there is one 
sphere for each fundamental cell of the lattice, and hence the density equals 

tt"/^ 1 
2"(n/2)! ' vol(M"/A)' 

Now we apply Poisson summation to see that 



^^^"^^ vol(R"/A) 
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The left side is bounded above by /(O), because all the other terms come from 
|a;| > 1 and are thus nonpositive by assumption. The right side is bounded below 
by /(O)/ vol(M"/A), because all the other terms are nonnegative. Thus, 



■'^ ' - vol(M"/A)' 

which is equivalent to the density bound in Theorem 15. II 

The proof in the general case is completely analogous. It suffices to prove 
the bound for all periodic packings (because they come arbitrarily close to the 
maximal density) , and one can apply a version of Poisson summation for summing 
over translates of a lattice. See [16] for the details, as well as for an explanation 
of the analogy between these linear programming bounds and those for compact 
spaces. 

5.2. Apparent optimality of Eg and the Leech lattice. Theo- 
rem [53] does not explain how to choose the function /, and for n > 1 the optimal 
choice of / is unknown. However, one can use numerical methods to optimize the 
density bound, for example by choosing f{x) to be e~'^'^' times a polynomial in 
|a;p (so that the Fourier transform can be easily computed) and then optimizing 
the choice of the polynomial. For 4 < n < 36, the results were collected in Table 3 
of [16], and in each case the bound is the best one known, but they are typically 
nowhere near sharp. For example, when n — 36, the upper bound is roughly 58.2 
times the best packing density known. That was an improvement on the previous 
bound, which was off by a factor of 89.7, but the gap remains enormous. 
However, for ti = 2, 8, or 24, Theorem 15.11 appears to be sharp: 

Conjecture 5.2 (Cohn and Elkies 16 ). For n = 2, 8, or 24, there exists a 
function f that proves a sharp bound in Theorem \5.1\ (for the hexagonal, Eg , or 
Leech lattice, respectively) . 

The strongest numerical evidence comes from [18]: for n = 24 the bound is 
sharp to within a factor of 1 + 1.65 • 10"'^''. Similar accuracy can be obtained for 
n = 8 or n = 2, although only 10~^^ was reported in [TB]. Of course, for ti = 2 the 
sphere packing problem has already been solved, but Coniecturc 15.21 is open. 

This apparent sharpness is analogous to the sharpness of the linear program- 
ming bounds for the kissing number in R^, M®, and M^^. In that problem, it would 
have sufficed to prove any bound less than the answer plus one, because the kissing 
number must be an integer, but the bounds in fact turn out to be exact integers. In 
the case of the sphere packing problem, the analogous exactness is needed (because 
packing density is not quantized), and fortunately it appears to be true. 

Examining the proof of Theorem l5 . II gives simple conditions for when the bound 
can be sharp for a lattice A, analogous to the conditions for Theorem 14. II / must 
vanish at each nonzero point in A and / must vanish at each nonzero point in A*. 
In fact, the same must be true for all rotations of A, so / and / must vanish at 
these radii (even if they are not radial functions). Unfortunately, it seems difficult 
to control the behavior of / and / simultaneously. 
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n 



function order 



coefficient 



conjecture 



8 
8 

24 
24 
8 
8 

24 



9 2 

9 2 

9 2 

9 2 

9 4 

9 4 

9 4 

5 4 



2.7000000000000000000000000 . 
1.5000000000000000000000000 . 
2.6276556776556776556776556 . 
1.3141025641025641025641025 . 
4.2167501240968298210999141 . 
1.2397969070295980026220772 . 
3.8619903167183007758184168 . 
0.7376727789015322303799539 . 



-27/10 
-3/2 
14347/5460 
205/156 



? 



? 



? 



24 



? 



Table 2. Approximate Taylor series coefficients of g and g about 0. 

For the special case of lattices, however, it is possible to complete a proof. 

Theorem 5.3 (Cohn and Kumar [18j). The Leech lattice is the unique densest 
lattice in M^"*, up to scaling and isometrics. 

The proof uses Theorem 15.11 to show that no sphere packing in M^'* can be 
more than slightly denser than the Leech lattice, and that every lattice as dense 
as the Leech lattice must be very close to it. However, the Leech lattice is a 
locally optimal packing among lattices, and the bounds can be made close enough 
to complete the proof. This approach also yields a new proof of optimality and 
uniqueness for £"§ (previously shown in |10| and [53]). 

One noteworthy hint regarding the optimal functions / in and M^"* is an 
observation of Cohn and Miller [20] about the Taylor series coefficients of /. It is 
more convenient to use the rescaled function g{x) — f{rx), where r = %/2 when 
n = 8 and r = 2 when n = 24. Then 17(0) = g{0), and without loss of generality 
let this value be 1. Assuming g is radial, we can view g and g as functions of one 
variable and ask for their Taylor series coefficients. Only even exponents occur 
by radial symmetry, so the first nontrivial terms are quadratic. Cohn and Miller 
noticed that the quadratic coefficients appear to be rational numbers, as shown in 
Table [21 The quartic terms seem more subtle, and it is not clear whether they are 
rational as well. If they are, then their denominators are probably much larger. 

More generally, one can study not just the sphere packing problem, but also 
potential energy minimization in Euclidean space. The total energy of a periodic 
configuration will be infinite, because each distance occurs infinitely many times, 
but one can instead try to minimize the average energy per particle. Some of the 
densest packings minimize more general forms of energy, but others do not, and 
simulations lead to many intriguing structures (19j . 

Cohn and Kumar [17 proved linear programming bounds for energy and made 
a conjecture analogous to Conjecture 15.21 

Conjecture 5.4 (Cohn and Kumar 17 ). For n — 2, 8, or 24, the linear program- 
ming bounds for potential energy minimization in M" are sharp for every completely 
monotonic potential function (for the hexagonal, Eg, or Leech lattice, respectively). 

This universal optimality would be a dramatic strengthening of mere optimality 
as packings. It is not even known in the two-dimensional case. 
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6. Future prospects 

The most pressing question raised by this work is how to prove that the hexagonal 
lattice, Eg, and the Leech lattice are universally optimal in Euclidean space. Linear 
programming bounds reduce this problem to finding certain auxiliary functions of 
one variable, and the optimal functions can even be computed to high precision, 
but so far there is no proof that they truly exist. 

More generally, can we classify the universal optima in a given space? No proof 
is known even that the list of examples in is complete, although it very likely 
is. Each of the known universal optima is such a remarkable mathematical object 
that a classification would be highly desirable: if there are any others out there, 
we ought to find them. 

One noteworthy case is equiangular line configurations in complex space. Do 
there exist unit vectors xi, . . . , x^^ G C" such that for i 7^ j, \{xi,Xj)\'^ is 
independent of i and j (in which case one can show it must be l/(n + 1))? In 
other words, the complex lines through these vectors are equidistant under the 
Fubini-Study metric in CP"~^. Zauner [56] conjectured that the answer is yes for 
all n, and substantial numerical evidence supports that conjecture 44j, but only 
finitely many cases have been proved. A collection of vectors with this property 
gives an n^-point universal optimum in CP"^^, by Theorem 8.2 in 17^. This case 
is particularly unusual, because normally the difficulty is in proving optimality for 
a configuration that has already been constructed, rather than constructing one 
that has already been proved optimal (should it exist). 

These equiangular line configurations are in fact closely analogous to Hadamard 
matrices. They can be characterized as exactly the simplices in CP"~^ that are 
projective 2-designs (where a simplex is simply a set of points for which all pairwise 
distances are equal). Similarly, Hadamard designs, which are an equivalent variant 
of Hadamard matrices [3^ , are symmetric block 2-designs that are simplices under 
the Hamming distance between blocks. The existence of Hadamard matrices of 
all orders divisible by four is a famous unsolved problem in combinatorics, and 
perhaps the problem of equiangular lines in C" will be equally difficult. 

These two problems are finely balanced between order and disorder. Any 
Hadamard matrix or equiangular line configuration must have considerable struc- 
ture, but in practice they frequently seem to have just enough structure to be 
tantalizing, without enough to guarantee a clear construction. This contrasts with 
many of the most symmetrical mathematical objects, which are characterized by 
their symmetry groups: once you know the full group and the stabilizer of a point, 
it is often not hard to deduce the structure of the complete object. That seems 
not to be possible in either of these two problems, and it stands as a challenge to 
find techniques that can circumvent this difficulty. 

In conclusion, packing and energy minimization problems exhibit greatly vary- 
ing degrees of symmetry and order in their solutions. In certain cases, the solutions 
are extraordinary mathematical objects such as or the Leech lattice. Sometimes 
this can be proved, and sometimes it comes down to simply stated yet elusive con- 
jectures. In other cases, the solutions may contain defects or involve unexpectedly 
complicated structures. Numerical experiments suggest that this is the default 
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behavior, but it is difficult to predict exactly when or how it will occur. Finally, in 
rare cases there appears to be order of an unusually subtle type, as in the complex 
equiangular line problem, and this type of order remains a mystery. 
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